Overview

Dataset statistics

Number of variables43
Number of observations25191
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 MiB
Average record size in memory344.0 B

Variable types

NUM29
BOOL8
CAT6

Warnings

num_outbound_cmds has constant value "25191" Constant
is_host_login has constant value "25191" Constant
service has a high cardinality: 66 distinct values High cardinality
num_root is highly correlated with num_compromisedHigh correlation
num_compromised is highly correlated with num_rootHigh correlation
srv_serror_rate is highly correlated with serror_rate and 2 other fieldsHigh correlation
serror_rate is highly correlated with srv_serror_rate and 2 other fieldsHigh correlation
srv_rerror_rate is highly correlated with rerror_rate and 2 other fieldsHigh correlation
rerror_rate is highly correlated with srv_rerror_rate and 2 other fieldsHigh correlation
dst_host_serror_rate is highly correlated with serror_rate and 2 other fieldsHigh correlation
dst_host_srv_serror_rate is highly correlated with serror_rate and 2 other fieldsHigh correlation
dst_host_rerror_rate is highly correlated with rerror_rate and 2 other fieldsHigh correlation
dst_host_srv_rerror_rate is highly correlated with rerror_rate and 2 other fieldsHigh correlation
service is highly correlated with protocol_typeHigh correlation
protocol_type is highly correlated with serviceHigh correlation
attack is highly correlated with wrong_fragmentHigh correlation
wrong_fragment is highly correlated with attackHigh correlation
src_bytes is highly skewed (γ1 = 157.5554145) Skewed
dst_bytes is highly skewed (γ1 = 54.77648949) Skewed
num_failed_logins is highly skewed (γ1 = 53.31154923) Skewed
num_compromised is highly skewed (γ1 = 62.18985248) Skewed
num_root is highly skewed (γ1 = 62.31982588) Skewed
num_file_creations is highly skewed (γ1 = 52.14065242) Skewed
num_access_files is highly skewed (γ1 = 41.75193449) Skewed
duration has 23167 (92.0%) zeros Zeros
src_bytes has 9866 (39.2%) zeros Zeros
dst_bytes has 13573 (53.9%) zeros Zeros
hot has 24671 (97.9%) zeros Zeros
num_failed_logins has 25168 (99.9%) zeros Zeros
num_compromised has 24919 (98.9%) zeros Zeros
num_root has 25057 (99.5%) zeros Zeros
num_file_creations has 25125 (99.7%) zeros Zeros
num_access_files has 25112 (99.7%) zeros Zeros
serror_rate has 17328 (68.8%) zeros Zeros
srv_serror_rate has 17707 (70.3%) zeros Zeros
rerror_rate has 21984 (87.3%) zeros Zeros
srv_rerror_rate has 21958 (87.2%) zeros Zeros
same_srv_rate has 543 (2.2%) zeros Zeros
diff_srv_rate has 15244 (60.5%) zeros Zeros
srv_diff_host_rate has 19516 (77.5%) zeros Zeros
dst_host_same_srv_rate has 1379 (5.5%) zeros Zeros
dst_host_diff_srv_rate has 9343 (37.1%) zeros Zeros
dst_host_same_src_port_rate has 12673 (50.3%) zeros Zeros
dst_host_srv_diff_host_rate has 17386 (69.0%) zeros Zeros
dst_host_serror_rate has 16220 (64.4%) zeros Zeros
dst_host_srv_serror_rate has 17004 (67.5%) zeros Zeros
dst_host_rerror_rate has 20688 (82.1%) zeros Zeros
dst_host_srv_rerror_rate has 21348 (84.7%) zeros Zeros

Reproduction

Analysis started2021-11-11 16:12:01.151985
Analysis finished2021-11-11 16:13:37.392303
Duration1 minute and 36.24 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

duration
Real number (ℝ≥0)

ZEROS

Distinct758
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean305.0662141
Minimum0
Maximum42862
Zeros23167
Zeros (%)92.0%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile5
Maximum42862
Range42862
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2686.608278
Coefficient of variation (CV)8.806639849
Kurtosis146.6950284
Mean305.0662141
Median Absolute Deviation (MAD)0
Skewness11.53240986
Sum7684923
Variance7217864.038
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
02316792.0%
 
13741.5%
 
21650.7%
 
31020.4%
 
4750.3%
 
5620.2%
 
6410.2%
 
27400.2%
 
28380.2%
 
7260.1%
 
Other values (748)11014.4%
 
ValueCountFrequency (%) 
02316792.0%
 
13741.5%
 
21650.7%
 
31020.4%
 
4750.3%
 
ValueCountFrequency (%) 
428621< 0.1%
 
426581< 0.1%
 
426361< 0.1%
 
424701< 0.1%
 
422601< 0.1%
 

protocol_type
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
tcp
20525 
udp
3011 
icmp
 
1655
ValueCountFrequency (%) 
tcp2052581.5%
 
udp301112.0%
 
icmp16556.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length4
Median length3
Mean length3.065698067
Min length3

service
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct66
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
http
8003 
private
4351 
domain_u
1820 
smtp
1449 
ftp_data
1395 
Other values (61)
8173 
ValueCountFrequency (%) 
http800331.8%
 
private435117.3%
 
domain_u18207.2%
 
smtp14495.8%
 
ftp_data13955.5%
 
eco_i9093.6%
 
other8583.4%
 
ecr_i6132.4%
 
telnet4831.9%
 
finger3661.5%
 
Other values (56)494419.6%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length11
Median length5
Mean length5.473581835
Min length3

flag
Categorical

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
SF
14972 
S0
7009 
REJ
2216 
RSTR
 
497
RSTO
 
304
Other values (6)
 
193
ValueCountFrequency (%) 
SF1497259.4%
 
S0700927.8%
 
REJ22168.8%
 
RSTR4972.0%
 
RSTO3041.2%
 
S1880.3%
 
SH430.2%
 
S2210.1%
 
RSTOS0210.1%
 
S3150.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.155095074
Min length2

src_bytes
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct1665
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24331.57457
Minimum0
Maximum381709090
Zeros9866
Zeros (%)39.2%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median44
Q3279
95-th percentile1486.5
Maximum381709090
Range381709090
Interquartile range (IQR)279

Descriptive statistics

Standard deviation2410853.249
Coefficient of variation (CV)99.08332247
Kurtosis24943.62413
Mean24331.57457
Median Absolute Deviation (MAD)44
Skewness157.5554145
Sum612936695
Variance5.81221339e+12
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0986639.2%
 
87382.9%
 
14801.9%
 
444671.9%
 
454161.7%
 
10323901.5%
 
462841.1%
 
432310.9%
 
1472100.8%
 
1052040.8%
 
Other values (1655)1190547.3%
 
ValueCountFrequency (%) 
0986639.2%
 
14801.9%
 
41< 0.1%
 
54< 0.1%
 
6320.1%
 
ValueCountFrequency (%) 
3817090901< 0.1%
 
76658761< 0.1%
 
72485521< 0.1%
 
51356783< 0.1%
 
51338768< 0.1%
 

dst_bytes
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct3922
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3491.985789
Minimum0
Maximum5151385
Zeros13573
Zeros (%)53.9%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3530.5
95-th percentile8314
Maximum5151385
Range5151385
Interquartile range (IQR)530.5

Descriptive statistics

Standard deviation88832.4788
Coefficient of variation (CV)25.43895771
Kurtosis3130.04834
Mean3491.985789
Median Absolute Deviation (MAD)0
Skewness54.77648949
Sum87966614
Variance7891209290
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01357353.9%
 
1053091.2%
 
83141750.7%
 
441150.5%
 
421050.4%
 
3301050.4%
 
3321030.4%
 
331970.4%
 
4940.4%
 
329880.3%
 
Other values (3912)1042741.4%
 
ValueCountFrequency (%) 
01357353.9%
 
16< 0.1%
 
4940.4%
 
157< 0.1%
 
177< 0.1%
 
ValueCountFrequency (%) 
51513851< 0.1%
 
51508361< 0.1%
 
51507721< 0.1%
 
51501801< 0.1%
 
51495331< 0.1%
 

land
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25189 
1
 
2
ValueCountFrequency (%) 
025189> 99.9%
 
12< 0.1%
 

wrong_fragment
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
24967 
3
 
187
1
 
37
ValueCountFrequency (%) 
02496799.1%
 
31870.7%
 
1370.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

urgent
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25190 
1
 
1
ValueCountFrequency (%) 
025190> 99.9%
 
11< 0.1%
 

hot
Real number (ℝ≥0)

ZEROS

Distinct22
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1980469215
Minimum0
Maximum77
Zeros24671
Zeros (%)97.9%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum77
Range77
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.154244164
Coefficient of variation (CV)10.87744332
Kurtosis213.6893054
Mean0.1980469215
Median Absolute Deviation (MAD)0
Skewness13.58926356
Sum4989
Variance4.64076792
MonotocityNot monotonic
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%) 
02467197.9%
 
22000.8%
 
1780.3%
 
30550.2%
 
28520.2%
 
4370.1%
 
6260.1%
 
5170.1%
 
22130.1%
 
249< 0.1%
 
Other values (12)330.1%
 
ValueCountFrequency (%) 
02467197.9%
 
1780.3%
 
22000.8%
 
37< 0.1%
 
4370.1%
 
ValueCountFrequency (%) 
771< 0.1%
 
30550.2%
 
28520.2%
 
251< 0.1%
 
249< 0.1%
 

num_failed_logins
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.001190901512
Minimum0
Maximum4
Zeros25168
Zeros (%)99.9%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum4
Range4
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.04541908114
Coefficient of variation (CV)38.13840244
Kurtosis3611.778831
Mean0.001190901512
Median Absolute Deviation (MAD)0
Skewness53.31154923
Sum30
Variance0.002062892932
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
02516899.9%
 
1190.1%
 
22< 0.1%
 
41< 0.1%
 
31< 0.1%
 
ValueCountFrequency (%) 
02516899.9%
 
1190.1%
 
22< 0.1%
 
31< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
41< 0.1%
 
31< 0.1%
 
22< 0.1%
 
1190.1%
 
02516899.9%
 

logged_in
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
15246 
1
9945 
ValueCountFrequency (%) 
01524660.5%
 
1994539.5%
 

num_compromised
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct28
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.227859156
Minimum0
Maximum884
Zeros24919
Zeros (%)98.9%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum884
Range884
Interquartile range (IQR)0

Descriptive statistics

Standard deviation10.41755871
Coefficient of variation (CV)45.71928946
Kurtosis4313.604413
Mean0.227859156
Median Absolute Deviation (MAD)0
Skewness62.18985248
Sum5740
Variance108.5255295
MonotocityNot monotonic
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%) 
02491998.9%
 
11940.8%
 
2210.1%
 
4130.1%
 
68< 0.1%
 
37< 0.1%
 
55< 0.1%
 
122< 0.1%
 
1512< 0.1%
 
72< 0.1%
 
Other values (18)180.1%
 
ValueCountFrequency (%) 
02491998.9%
 
11940.8%
 
2210.1%
 
37< 0.1%
 
4130.1%
 
ValueCountFrequency (%) 
8841< 0.1%
 
7891< 0.1%
 
5581< 0.1%
 
5201< 0.1%
 
4621< 0.1%
 

root_shell
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25152 
1
 
39
ValueCountFrequency (%) 
02515299.8%
 
1390.2%
 

su_attempted
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25170 
2
 
13
1
 
8
ValueCountFrequency (%) 
02517099.9%
 
2130.1%
 
18< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

num_root
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct28
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2498511373
Minimum0
Maximum975
Zeros25057
Zeros (%)99.5%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum975
Range975
Interquartile range (IQR)0

Descriptive statistics

Standard deviation11.50106997
Coefficient of variation (CV)46.03168948
Kurtosis4315.596023
Mean0.2498511373
Median Absolute Deviation (MAD)0
Skewness62.31982588
Sum6294
Variance132.2746104
MonotocityNot monotonic
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%) 
02505799.5%
 
1470.2%
 
9240.1%
 
6230.1%
 
210< 0.1%
 
56< 0.1%
 
32< 0.1%
 
42< 0.1%
 
911< 0.1%
 
5121< 0.1%
 
Other values (18)180.1%
 
ValueCountFrequency (%) 
02505799.5%
 
1470.2%
 
210< 0.1%
 
32< 0.1%
 
42< 0.1%
 
ValueCountFrequency (%) 
9751< 0.1%
 
8671< 0.1%
 
6291< 0.1%
 
5721< 0.1%
 
5121< 0.1%
 

num_file_creations
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct20
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.01472748204
Minimum0
Maximum40
Zeros25125
Zeros (%)99.7%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum40
Range40
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5296128041
Coefficient of variation (CV)35.96085215
Kurtosis3158.079648
Mean0.01472748204
Median Absolute Deviation (MAD)0
Skewness52.14065242
Sum371
Variance0.2804897223
MonotocityNot monotonic
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%) 
02512599.7%
 
1370.1%
 
27< 0.1%
 
43< 0.1%
 
82< 0.1%
 
182< 0.1%
 
52< 0.1%
 
381< 0.1%
 
61< 0.1%
 
211< 0.1%
 
Other values (10)10< 0.1%
 
ValueCountFrequency (%) 
02512599.7%
 
1370.1%
 
27< 0.1%
 
31< 0.1%
 
43< 0.1%
 
ValueCountFrequency (%) 
401< 0.1%
 
381< 0.1%
 
291< 0.1%
 
211< 0.1%
 
201< 0.1%
 

num_shells
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25182 
1
 
9
ValueCountFrequency (%) 
025182> 99.9%
 
19< 0.1%
 

num_access_files
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.004326942162
Minimum0
Maximum8
Zeros25112
Zeros (%)99.7%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.0985259286
Coefficient of variation (CV)22.7703364
Kurtosis2499.808557
Mean0.004326942162
Median Absolute Deviation (MAD)0
Skewness41.75193449
Sum109
Variance0.009707358607
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
02511299.7%
 
1650.3%
 
28< 0.1%
 
52< 0.1%
 
32< 0.1%
 
81< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
02511299.7%
 
1650.3%
 
28< 0.1%
 
32< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
81< 0.1%
 
52< 0.1%
 
41< 0.1%
 
32< 0.1%
 
28< 0.1%
 

num_outbound_cmds
Boolean

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25191 
ValueCountFrequency (%) 
025191100.0%
 

is_host_login
Boolean

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
25191 
ValueCountFrequency (%) 
025191100.0%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
0
24961 
1
 
230
ValueCountFrequency (%) 
02496199.1%
 
12300.9%
 

_count
Real number (ℝ≥0)

Distinct466
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.59445834
Minimum1
Maximum511
Zeros0
Zeros (%)0.0%
Memory size196.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median14
Q3144
95-th percentile286
Maximum511
Range510
Interquartile range (IQR)142

Descriptive statistics

Standard deviation114.6745463
Coefficient of variation (CV)1.355579887
Kurtosis1.977842794
Mean84.59445834
Median Absolute Deviation (MAD)13
Skewness1.503678192
Sum2131019
Variance13150.25157
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1551921.9%
 
219337.7%
 
37693.1%
 
46962.8%
 
55972.4%
 
64771.9%
 
74621.8%
 
84041.6%
 
93421.4%
 
113171.3%
 
Other values (456)1367554.3%
 
ValueCountFrequency (%) 
1551921.9%
 
219337.7%
 
37693.1%
 
46962.8%
 
55972.4%
 
ValueCountFrequency (%) 
5112931.2%
 
510580.2%
 
509490.2%
 
5086< 0.1%
 
5071< 0.1%
 

srv_count
Real number (ℝ≥0)

Distinct414
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.69977373
Minimum1
Maximum511
Zeros0
Zeros (%)0.0%
Memory size196.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median8
Q318
95-th percentile157
Maximum511
Range510
Interquartile range (IQR)16

Descriptive statistics

Standard deviation72.46949952
Coefficient of variation (CV)2.616248791
Kurtosis24.39561696
Mean27.69977373
Median Absolute Deviation (MAD)7
Skewness4.707424196
Sum697785
Variance5251.82836
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1508020.2%
 
2253710.1%
 
312234.9%
 
410864.3%
 
59133.6%
 
68493.4%
 
78003.2%
 
87513.0%
 
97182.9%
 
116882.7%
 
Other values (404)1054641.9%
 
ValueCountFrequency (%) 
1508020.2%
 
2253710.1%
 
312234.9%
 
410864.3%
 
59133.6%
 
ValueCountFrequency (%) 
5112000.8%
 
510360.1%
 
5096< 0.1%
 
5082< 0.1%
 
5001< 0.1%
 

serror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct70
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2863490929
Minimum0
Maximum1
Zeros17328
Zeros (%)68.8%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.4473175631
Coefficient of variation (CV)1.562140667
Kurtosis-1.074506772
Mean0.2863490929
Median Absolute Deviation (MAD)0
Skewness0.9525852523
Sum7213.42
Variance0.2000930022
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01732868.8%
 
1694127.6%
 
0.51220.5%
 
0.07530.2%
 
0.05500.2%
 
0.06500.2%
 
0.33480.2%
 
0.08460.2%
 
0.01460.2%
 
0.25430.2%
 
Other values (60)4641.8%
 
ValueCountFrequency (%) 
01732868.8%
 
0.01460.2%
 
0.02160.1%
 
0.03310.1%
 
0.04290.1%
 
ValueCountFrequency (%) 
1694127.6%
 
0.99410.2%
 
0.9812< 0.1%
 
0.97160.1%
 
0.967< 0.1%
 

srv_serror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct56
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2837735699
Minimum0
Maximum1
Zeros17707
Zeros (%)70.3%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.4476042213
Coefficient of variation (CV)1.577328788
Kurtosis-1.058006171
Mean0.2837735699
Median Absolute Deviation (MAD)0
Skewness0.9634382678
Sum7148.54
Variance0.2003495389
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01770770.3%
 
1700327.8%
 
0.5940.4%
 
0.33510.2%
 
0.25420.2%
 
0.2320.1%
 
0.05260.1%
 
0.17220.1%
 
0.04200.1%
 
0.03200.1%
 
Other values (46)1740.7%
 
ValueCountFrequency (%) 
01770770.3%
 
0.011< 0.1%
 
0.02130.1%
 
0.03200.1%
 
0.04200.1%
 
ValueCountFrequency (%) 
1700327.8%
 
0.959< 0.1%
 
0.941< 0.1%
 
0.931< 0.1%
 
0.923< 0.1%
 

rerror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct72
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1186348299
Minimum0
Maximum1
Zeros21984
Zeros (%)87.3%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3187509212
Coefficient of variation (CV)2.686824109
Kurtosis3.546871697
Mean0.1186348299
Median Absolute Deviation (MAD)0
Skewness2.346288847
Sum2988.53
Variance0.1016021497
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
02198487.3%
 
1255210.1%
 
0.9430.2%
 
0.89390.2%
 
0.91380.2%
 
0.92380.2%
 
0.95370.1%
 
0.5360.1%
 
0.93340.1%
 
0.94310.1%
 
Other values (62)3591.4%
 
ValueCountFrequency (%) 
02198487.3%
 
0.018< 0.1%
 
0.02150.1%
 
0.03210.1%
 
0.049< 0.1%
 
ValueCountFrequency (%) 
1255210.1%
 
0.996< 0.1%
 
0.982< 0.1%
 
0.977< 0.1%
 
0.9611< 0.1%
 

srv_rerror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct42
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1202651741
Minimum0
Maximum1
Zeros21958
Zeros (%)87.2%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3223408593
Coefficient of variation (CV)2.680251052
Kurtosis3.513549391
Mean0.1202651741
Median Absolute Deviation (MAD)0
Skewness2.340717599
Sum3029.6
Variance0.1039036296
MonotocityNot monotonic
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%) 
02195887.2%
 
1293711.7%
 
0.5560.2%
 
0.33320.1%
 
0.25260.1%
 
0.17170.1%
 
0.2160.1%
 
0.04140.1%
 
0.0811< 0.1%
 
0.6711< 0.1%
 
Other values (32)1130.4%
 
ValueCountFrequency (%) 
02195887.2%
 
0.029< 0.1%
 
0.038< 0.1%
 
0.04140.1%
 
0.058< 0.1%
 
ValueCountFrequency (%) 
1293711.7%
 
0.851< 0.1%
 
0.841< 0.1%
 
0.833< 0.1%
 
0.812< 0.1%
 

same_srv_rate
Real number (ℝ≥0)

ZEROS

Distinct97
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6605454329
Minimum0
Maximum1
Zeros543
Zeros (%)2.2%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0.01
Q10.09
median1
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.91

Descriptive statistics

Standard deviation0.4396409045
Coefficient of variation (CV)0.6655725444
Kurtosis-1.611808292
Mean0.6605454329
Median Absolute Deviation (MAD)0
Skewness-0.5704244952
Sum16639.8
Variance0.1932841249
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
11535661.0%
 
0.018273.3%
 
0.027102.8%
 
0.066812.7%
 
0.036782.7%
 
0.076712.7%
 
0.046582.6%
 
0.086012.4%
 
0.055902.3%
 
05432.2%
 
Other values (87)387615.4%
 
ValueCountFrequency (%) 
05432.2%
 
0.018273.3%
 
0.027102.8%
 
0.036782.7%
 
0.046582.6%
 
ValueCountFrequency (%) 
11535661.0%
 
0.991470.6%
 
0.98180.1%
 
0.979< 0.1%
 
0.964< 0.1%
 

diff_srv_rate
Real number (ℝ≥0)

ZEROS

Distinct79
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.06236552737
Minimum0
Maximum1
Zeros15244
Zeros (%)60.5%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.06
95-th percentile0.29
Maximum1
Range1
Interquartile range (IQR)0.06

Descriptive statistics

Standard deviation0.1785531078
Coefficient of variation (CV)2.863009668
Kurtosis19.29491884
Mean0.06236552737
Median Absolute Deviation (MAD)0
Skewness4.417653605
Sum1571.05
Variance0.03188121232
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01524460.5%
 
0.06386115.3%
 
0.0719477.7%
 
0.0513505.4%
 
16632.6%
 
0.083741.5%
 
0.011940.8%
 
0.041300.5%
 
0.091210.5%
 
0.51160.5%
 
Other values (69)11914.7%
 
ValueCountFrequency (%) 
01524460.5%
 
0.011940.8%
 
0.02490.2%
 
0.03480.2%
 
0.041300.5%
 
ValueCountFrequency (%) 
16632.6%
 
0.9910< 0.1%
 
0.982< 0.1%
 
0.971< 0.1%
 
0.968< 0.1%
 

srv_diff_host_rate
Real number (ℝ≥0)

ZEROS

Distinct57
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0959346592
Minimum0
Maximum1
Zeros19516
Zeros (%)77.5%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2565872279
Coefficient of variation (CV)2.674604048
Kurtosis7.009699745
Mean0.0959346592
Median Absolute Deviation (MAD)0
Skewness2.885866409
Sum2416.69
Variance0.0658370055
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01951677.5%
 
115596.2%
 
0.015862.3%
 
0.672100.8%
 
0.51930.8%
 
0.121700.7%
 
0.331670.7%
 
0.251640.7%
 
0.021530.6%
 
0.111430.6%
 
Other values (47)23309.2%
 
ValueCountFrequency (%) 
01951677.5%
 
0.015862.3%
 
0.021530.6%
 
0.03400.2%
 
0.04390.2%
 
ValueCountFrequency (%) 
115596.2%
 
0.881< 0.1%
 
0.831< 0.1%
 
0.8140.1%
 
0.75580.2%
 

dst_host_count
Real number (ℝ≥0)

Distinct256
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean182.5333651
Minimum0
Maximum255
Zeros1
Zeros (%)< 0.1%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile3
Q184
median255
Q3255
95-th percentile255
Maximum255
Range255
Interquartile range (IQR)171

Descriptive statistics

Standard deviation98.99564787
Coefficient of variation (CV)0.5423427537
Kurtosis-1.044798262
Mean182.5333651
Median Absolute Deviation (MAD)0
Skewness-0.8431872319
Sum4598198
Variance9800.138297
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
2551485058.9%
 
16012.4%
 
25542.2%
 
32511.0%
 
42411.0%
 
51620.6%
 
61570.6%
 
81340.5%
 
101120.4%
 
111110.4%
 
Other values (246)801831.8%
 
ValueCountFrequency (%) 
01< 0.1%
 
16012.4%
 
25542.2%
 
32511.0%
 
42411.0%
 
ValueCountFrequency (%) 
2551485058.9%
 
254170.1%
 
253170.1%
 
252170.1%
 
25112< 0.1%
 

dst_host_srv_count
Real number (ℝ≥0)

Distinct256
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115.0666111
Minimum0
Maximum255
Zeros1
Zeros (%)< 0.1%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q110
median61
Q3255
95-th percentile255
Maximum255
Range255
Interquartile range (IQR)245

Descriptive statistics

Standard deviation110.6475914
Coefficient of variation (CV)0.9615959867
Kurtosis-1.751079331
Mean115.0666111
Median Absolute Deviation (MAD)59
Skewness0.2942359952
Sum2898643
Variance12242.88949
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
255714828.4%
 
116586.6%
 
210414.1%
 
35562.2%
 
45162.0%
 
204751.9%
 
54641.8%
 
64531.8%
 
2544401.7%
 
194331.7%
 
Other values (246)1200747.7%
 
ValueCountFrequency (%) 
01< 0.1%
 
116586.6%
 
210414.1%
 
35562.2%
 
45162.0%
 
ValueCountFrequency (%) 
255714828.4%
 
2544401.7%
 
253910.4%
 
252350.1%
 
251810.3%
 

dst_host_same_srv_rate
Real number (ℝ≥0)

ZEROS

Distinct101
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5198046922
Minimum0
Maximum1
Zeros1379
Zeros (%)5.5%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.05
median0.51
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.95

Descriptive statistics

Standard deviation0.4489474187
Coefficient of variation (CV)0.8636848138
Kurtosis-1.88463729
Mean0.5198046922
Median Absolute Deviation (MAD)0.49
Skewness-0.004097744734
Sum13094.4
Variance0.2015537848
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1975838.7%
 
0.0115416.1%
 
013795.5%
 
0.0213255.3%
 
0.0711224.5%
 
0.0410464.2%
 
0.0510174.0%
 
0.037993.2%
 
0.067012.8%
 
0.085772.3%
 
Other values (91)592623.5%
 
ValueCountFrequency (%) 
013795.5%
 
0.0115416.1%
 
0.0213255.3%
 
0.037993.2%
 
0.0410464.2%
 
ValueCountFrequency (%) 
1975838.7%
 
0.991210.5%
 
0.981700.7%
 
0.971010.4%
 
0.961600.6%
 

dst_host_diff_srv_rate
Real number (ℝ≥0)

ZEROS

Distinct101
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.08254058989
Minimum0
Maximum1
Zeros9343
Zeros (%)37.1%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.03
Q30.07
95-th percentile0.56
Maximum1
Range1
Interquartile range (IQR)0.07

Descriptive statistics

Standard deviation0.1871945364
Coefficient of variation (CV)2.267908875
Kurtosis12.72682201
Mean0.08254058989
Median Absolute Deviation (MAD)0.03
Skewness3.616097637
Sum2079.28
Variance0.03504179445
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0934337.1%
 
0.07344813.7%
 
0.0619177.6%
 
0.0118817.5%
 
0.0514365.7%
 
0.0813675.4%
 
0.0213275.3%
 
0.037443.0%
 
0.046032.4%
 
0.095232.1%
 
Other values (91)260210.3%
 
ValueCountFrequency (%) 
0934337.1%
 
0.0118817.5%
 
0.0213275.3%
 
0.037443.0%
 
0.046032.4%
 
ValueCountFrequency (%) 
14081.6%
 
0.997< 0.1%
 
0.986< 0.1%
 
0.97180.1%
 
0.9612< 0.1%
 

dst_host_same_src_port_rate
Real number (ℝ≥0)

ZEROS

Distinct101
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1474518677
Minimum0
Maximum1
Zeros12673
Zeros (%)50.3%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.06
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.06

Descriptive statistics

Standard deviation0.3083726791
Coefficient of variation (CV)2.09134468
Kurtosis2.810599508
Mean0.1474518677
Median Absolute Deviation (MAD)0
Skewness2.098494475
Sum3714.46
Variance0.09509370923
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01267350.3%
 
0.01355714.1%
 
120528.1%
 
0.0211154.4%
 
0.036242.5%
 
0.044471.8%
 
0.053151.3%
 
0.52320.9%
 
0.082300.9%
 
0.062260.9%
 
Other values (91)372014.8%
 
ValueCountFrequency (%) 
01267350.3%
 
0.01355714.1%
 
0.0211154.4%
 
0.036242.5%
 
0.044471.8%
 
ValueCountFrequency (%) 
120528.1%
 
0.99190.1%
 
0.98370.1%
 
0.97320.1%
 
0.96460.2%
 

dst_host_srv_diff_host_rate
Real number (ℝ≥0)

ZEROS

Distinct63
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.03184550038
Minimum0
Maximum1
Zeros17386
Zeros (%)69.0%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.02
95-th percentile0.18
Maximum1
Range1
Interquartile range (IQR)0.02

Descriptive statistics

Standard deviation0.1105769816
Coefficient of variation (CV)3.47229531
Kurtosis36.89752202
Mean0.03184550038
Median Absolute Deviation (MAD)0
Skewness5.616948128
Sum802.22
Variance0.01222726886
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01738669.0%
 
0.0216126.4%
 
0.0114685.8%
 
0.039503.8%
 
0.048703.5%
 
0.056082.4%
 
0.53171.3%
 
0.062641.0%
 
0.072150.9%
 
0.252050.8%
 
Other values (53)12965.1%
 
ValueCountFrequency (%) 
01738669.0%
 
0.0114685.8%
 
0.0216126.4%
 
0.039503.8%
 
0.048703.5%
 
ValueCountFrequency (%) 
11320.5%
 
0.971< 0.1%
 
0.861< 0.1%
 
0.81< 0.1%
 
0.753< 0.1%
 

dst_host_serror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct100
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2858115994
Minimum0
Maximum1
Zeros16220
Zeros (%)64.4%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.4453216733
Coefficient of variation (CV)1.558095173
Kurtosis-1.061476859
Mean0.2858115994
Median Absolute Deviation (MAD)0
Skewness0.9580857646
Sum7199.88
Variance0.1983113927
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01622064.4%
 
1673926.8%
 
0.016802.7%
 
0.022370.9%
 
0.031490.6%
 
0.04830.3%
 
0.09790.3%
 
0.08690.3%
 
0.05640.3%
 
0.99590.2%
 
Other values (90)8123.2%
 
ValueCountFrequency (%) 
01622064.4%
 
0.016802.7%
 
0.022370.9%
 
0.031490.6%
 
0.04830.3%
 
ValueCountFrequency (%) 
1673926.8%
 
0.99590.2%
 
0.98360.1%
 
0.97180.1%
 
0.96200.1%
 

dst_host_srv_serror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct88
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2798574888
Minimum0
Maximum1
Zeros17004
Zeros (%)67.5%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.4460806954
Coefficient of variation (CV)1.593956615
Kurtosis-1.021843643
Mean0.2798574888
Median Absolute Deviation (MAD)0
Skewness0.9842773495
Sum7049.89
Variance0.1989879868
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01700467.5%
 
1686227.2%
 
0.017583.0%
 
0.021360.5%
 
0.03320.1%
 
0.5240.1%
 
0.12160.1%
 
0.08160.1%
 
0.04150.1%
 
0.05150.1%
 
Other values (78)3131.2%
 
ValueCountFrequency (%) 
01700467.5%
 
0.017583.0%
 
0.021360.5%
 
0.03320.1%
 
0.04150.1%
 
ValueCountFrequency (%) 
1686227.2%
 
0.986< 0.1%
 
0.97130.1%
 
0.96130.1%
 
0.956< 0.1%
 

dst_host_rerror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct101
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1178027867
Minimum0
Maximum1
Zeros20688
Zeros (%)82.1%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3058750185
Coefficient of variation (CV)2.596500703
Kurtosis3.765270354
Mean0.1178027867
Median Absolute Deviation (MAD)0
Skewness2.363640783
Sum2967.57
Variance0.09355952696
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
02068882.1%
 
120698.2%
 
0.013591.4%
 
0.022320.9%
 
0.031100.4%
 
0.05840.3%
 
0.04770.3%
 
0.92520.2%
 
0.9460.2%
 
0.91430.2%
 
Other values (91)14315.7%
 
ValueCountFrequency (%) 
02068882.1%
 
0.013591.4%
 
0.022320.9%
 
0.031100.4%
 
0.04770.3%
 
ValueCountFrequency (%) 
120698.2%
 
0.997< 0.1%
 
0.9812< 0.1%
 
0.97200.1%
 
0.96390.2%
 

dst_host_srv_rerror_rate
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct100
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1187741654
Minimum0
Maximum1
Zeros21348
Zeros (%)84.7%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3173388846
Coefficient of variation (CV)2.671783747
Kurtosis3.632762439
Mean0.1187741654
Median Absolute Deviation (MAD)0
Skewness2.360413913
Sum2992.04
Variance0.1007039677
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
02134884.7%
 
1261710.4%
 
0.012531.0%
 
0.021240.5%
 
0.03780.3%
 
0.04690.3%
 
0.05630.3%
 
0.06360.1%
 
0.99330.1%
 
0.98300.1%
 
Other values (90)5402.1%
 
ValueCountFrequency (%) 
02134884.7%
 
0.012531.0%
 
0.021240.5%
 
0.03780.3%
 
0.04690.3%
 
ValueCountFrequency (%) 
1261710.4%
 
0.99330.1%
 
0.98300.1%
 
0.97190.1%
 
0.96170.1%
 

attack
Categorical

HIGH CORRELATION

Distinct22
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size196.8 KiB
normal
13448 
neptune
8282 
ipsweep
 
710
satan
 
691
portsweep
 
587
Other values (17)
1473 
ValueCountFrequency (%) 
normal1344853.4%
 
neptune828232.9%
 
ipsweep7102.8%
 
satan6912.7%
 
portsweep5872.3%
 
smurf5292.1%
 
nmap3011.2%
 
back1960.8%
 
teardrop1880.7%
 
warezclient1810.7%
 
Other values (12)780.3%
 
Frequencies of value counts

Unique

Unique4 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length15
Median length6
Mean length6.390972967
Min length3

level
Real number (ℝ≥0)

Distinct22
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.48767417
Minimum0
Maximum21
Zeros12
Zeros (%)< 0.1%
Memory size196.8 KiB

Quantile statistics

Minimum0
5-th percentile15
Q118
median20
Q321
95-th percentile21
Maximum21
Range21
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.328584844
Coefficient of variation (CV)0.1194901364
Kurtosis13.16406373
Mean19.48767417
Median Absolute Deviation (MAD)1
Skewness-2.900198359
Sum490914
Variance5.422307377
MonotocityNot monotonic
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%) 
211249549.6%
 
18414216.4%
 
20386115.3%
 
1920478.1%
 
158023.2%
 
175872.3%
 
164691.9%
 
121620.6%
 
141490.6%
 
111260.5%
 
Other values (12)3511.4%
 
ValueCountFrequency (%) 
012< 0.1%
 
1170.1%
 
210< 0.1%
 
3140.1%
 
4140.1%
 
ValueCountFrequency (%) 
211249549.6%
 
20386115.3%
 
1920478.1%
 
18414216.4%
 
175872.3%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

durationprotocol_typeserviceflagsrc_bytesdst_byteslandwrong_fragmenturgenthotnum_failed_loginslogged_innum_compromisedroot_shellsu_attemptednum_rootnum_file_creationsnum_shellsnum_access_filesnum_outbound_cmdsis_host_loginis_guest_login_countsrv_countserror_ratesrv_serror_ratererror_ratesrv_rerror_ratesame_srv_ratediff_srv_ratesrv_diff_host_ratedst_host_countdst_host_srv_countdst_host_same_srv_ratedst_host_diff_srv_ratedst_host_same_src_port_ratedst_host_srv_diff_host_ratedst_host_serror_ratedst_host_srv_serror_ratedst_host_rerror_ratedst_host_srv_rerror_rateattacklevel
00udpotherSF146000000000000000001310.00.00.00.00.080.150.0025510.000.600.880.000.000.000.00.00normal15
10tcpprivateS000000000000000000012361.01.00.00.00.050.070.00255260.100.050.000.001.001.000.00.00neptune19
20tcphttpSF23281530000010000000000550.20.20.00.01.000.000.00302551.000.000.030.040.030.010.00.01normal21
30tcphttpSF199420000001000000000030320.00.00.00.01.000.000.092552551.000.000.000.000.000.000.00.00normal21
40tcpprivateREJ000000000000000000121190.00.01.01.00.160.060.00255190.070.070.000.000.000.001.01.00neptune21
50tcpprivateS000000000000000000016691.01.00.00.00.050.060.0025590.040.050.000.001.001.000.00.00neptune21
60tcpprivateS0000000000000000000117161.01.00.00.00.140.060.00255150.060.070.000.001.001.000.00.00neptune21
70tcpremote_jobS0000000000000000000270231.01.00.00.00.090.050.00255230.090.050.000.001.001.000.00.00neptune21
80tcpprivateS000000000000000000013381.01.00.00.00.060.060.00255130.050.060.000.001.001.000.00.00neptune21
90tcpprivateREJ000000000000000000205120.00.01.01.00.060.060.00255120.050.070.000.000.000.001.01.00neptune21

Last rows

durationprotocol_typeserviceflagsrc_bytesdst_byteslandwrong_fragmenturgenthotnum_failed_loginslogged_innum_compromisedroot_shellsu_attemptednum_rootnum_file_creationsnum_shellsnum_access_filesnum_outbound_cmdsis_host_loginis_guest_login_countsrv_countserror_ratesrv_serror_ratererror_ratesrv_rerror_ratesame_srv_ratediff_srv_ratesrv_diff_host_ratedst_host_countdst_host_srv_countdst_host_same_srv_ratedst_host_diff_srv_ratedst_host_same_src_port_ratedst_host_srv_diff_host_ratedst_host_serror_ratedst_host_srv_serror_ratedst_host_rerror_ratedst_host_srv_rerror_rateattacklevel
251810tcpotherREJ00000000000000000051110.120.000.851.00.001.000.0025510.001.000.000.000.160.00.821.0satan20
251820tcpprivateREJ00000000000000000031410.030.000.951.00.001.000.0025510.001.000.000.000.040.00.961.0satan18
2518329tcpftpSF32910630006010000000001110.000.000.000.01.000.000.00255600.240.020.000.000.000.00.030.1normal20
251841tcpsmtpSF28963330000010000000000130.000.000.000.01.000.001.0012110.920.170.080.000.000.00.000.0normal21
251850tcphttpS13391460000000100000000002330.500.030.000.01.000.000.061732551.000.000.010.010.010.00.010.0normal20
251860tcpexecRSTO00000000000000000010070.000.001.001.00.070.070.0025570.030.060.000.000.000.01.001.0neptune19
251870tcpftp_dataSF33400000010000000000110.000.000.000.01.000.000.001391.000.001.000.180.000.00.000.0warezclient12
251880tcpprivateREJ00000000000000000010570.000.001.001.00.070.070.00255130.050.070.000.000.000.01.001.0neptune21
251890tcpnnspS0000000000000000000129181.001.000.000.00.140.060.00255200.080.060.000.001.001.00.000.0neptune20
251900tcpfingerS00000000000000000003891.001.000.000.00.240.110.00255490.190.030.010.001.001.00.000.0neptune18